مقایسه روش‌های مختلف یادگیری ماشین در خلاصه‌سازی استخراجی گفتار به گفتار فارسی بدون استفاده از رونوشت

نویسندگان

جعفری, هدی سادات دانشگاه صنعتی امیرکبیر

همایون پور, محمدمهدی دانشگاه صنعتی امیرکبیر

چکیده مقاله:

In this paper, extractive speech summarization using different machine learning algorithms was investigated. The task of Speech summarization deals with extracting important and salient segments from speech in order to access, search, extract and browse speech files easier and in a less costly manner. In this paper, a new method for speech summarization without using automatic speech recognition system (ASR) is proposed. ASR systems usually have high error rates especially in adverse acoustic environment and for low resource languages. Our goal was to answer this question: is it possible to summarize a Persian speech without ASR using less or no training data? We have proposed a method which discovers salient parts directly from speech signal by using a semi-supervised algorithm. The proposed algorithm consists of three main stages, features extraction, identifying key patterns and selecting important sentences. First we have segmented speech voices manually into sentences to eliminate sentence segmentation errors. Therefore, we could have better comparison between different summarization methods. Then we have extracted some features from each sentence such as sentence duration, if the sentence is first or last sentence in the speech and so on. Also, repetitive patterns between each two sentence of speech are discovered directly from speech signal by using S-DTW algorithm. S-DTW algorithm can discover repetitive patterns between two speech signals by using MFCC features. By using these repetitive patterns between each pair of sentences we can make a similarity matrix. Therefore, we could measure the similarity distance between each pair of sentences and eliminate redundant sentences from summary without the need to use an ASR system After finding the similarity between each two speech segments and extracting some features from each segment, various machine learning algorithms including unsupervised (MMR, TextRank), supervised (SVM, Naïve Bayes) and semi-supervised algorithms (self-training, Co-training) are used in order to extract salient parts. Experiences are done in read Persian news. The results show that using semi-supervised co-training method and appropriate features, the performance of speech summarization system on read Persian news corpus can improve about 3% compared to selecting the first sentences and by 5% compared to longest sentences when ROUGE-3 is used as the evaluation measure.

Download for Free

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

بررسی خطاهای گفتار در زبان فارسی

تاریخچه مطالعات مربوط به خطاهای گفتار حکایت از رویکردهای گوناگون به این پدیده دارد. برخی پژوهشگران با هدف آزمردن نظریه های شناختی به بررسی جنبه ها یی خاص از این پدیده از دریچه روان شناسی زبان پرداخته اند. عد ه ا ی دیگر تلأش کرد ه ا ند تا با دید زبان شناختی به خطای گفتار انو اعی از طبقه بندی را برای آن به دست دهند. اصطلاح های گوناگونی در این حوزه رواج دارد. بررسی و مقایسه تعاریفی که از این اصطلا...

متن کامل

یادگیری توالی‌های حرکتی گفتار در بزرگ‌سالان مبتلا به لکنت

Objective Developmental stuttering is a speech disorder characterized by repetition, prolongation, block and disruption of the smooth flow of speech. Environmental, physical, mental, and cognitive-linguistic factors were involved in the initiation and development of stuttering. There have been several theories about the development of stuttering. One of these theories suggests that stuttering i...

متن کامل

نگاهی جامعه شناختی به کنش گفتار انتقاد در زبان فارسی

زبان تجلی‌گاه فرهنگ و افکار افراد یک جامعه است و با بررسی زبان می‌توان به مطالعه رابطۀ متقابل زبان، تفکرو فرهنگ دست یافت و از نظام فکری و فرهنگی یک جامعه آگاه شد. لذا پژوهش حاضر در صدد است تا به بررسی کنش‌گفتار انتقاد در زبان فارسی بپردازد. پژوهش حاضر به بررسی این راهبردها در کنش گفتار انتقاد می پردازد. بدین منظور از آزمون تکمیل گفتمان استفاده شد و پاسخ های جمع‌آوری شده از 200 شرکت‌کننده با است...

متن کامل

تأثیر پیچیدگی نحوی بر میزان ناروانی گفتار کودکان لکنتی و غیرلکنتی فارسی زبان در گفتار خودانگیخته

سابقه و هدف: پژوهش‌های متعددی به بررسی ارتباط لکنت با عوامل زبان‌شناختی مختلف پرداخته‌اند. هدف از این پژوهش، تعیین و مقایسه تأثیر پیچیدگی نحوی بر میزان ناروانی گفتار کودکان لکنتی و غیرلکنتی فارسی زبان در سطح گفتار خودانگیخته می‌باشد. نتایج حاصله می‌توانند جهت شناخت بیش‌تر ماهیت لکنت و نیز یافتن شیوه‌های مناسب‌تر برای درمان آن کاربرد داشته باشند. مواد و روش‌ها: این مطالعه، یک مطالعه‌ی توصیفی- ت...

متن کامل

تشخیص حس وابسته به گویندة گفتار فارسی با استفاده از ویژگی‌های آکوستیکی

بیان احساس در ارتباطات روزمره از جایگاه ویژه‌ای برخوردار است. از جمله بسترهای نمود احساس، گفتار است. از این‌رو، یکی از جنبه‌های مهم در طبیعی‌سازی ارتباط میان انسان و ماشین، تشخیص حس گفتار و تولید بازخورد متناسب با احساس درک‌شده است. باوجود پیشرفت‌های گسترده در حوزة پردازش گفتار، استخراج و درک احساس پنهان در گفتار انسان، همچون خشم، شادی و جز این‌ها، از یک‌سو و تولید گفتار احساسی مناسب از سوی دیگ...

متن کامل

رتبه‌بندی واج‌های گفتار فارسی از نظر کارآیی در بازشناسی گوینده

در این مقاله، کارآیی واجهای گفتار فارسی از نظر بازشناسی گوینده مورد مطالعه و پژوهش قرار گرفته و با توجه به میزان کارآییها، رتبهبندی واجها صورت گرفته‌اند. جهت برآورد کارآیی واجها، از یک معیاری که بهصورت نسب « فاصلة بینگویندهای» واجها به « فاصلة در گوینده‌ای» تعریف شده است و ما آن را « نسبت تأثیرپذیری گوینده » نامیدهایم، استفاده شده است. آزمایشها و محاسبات لازم برای کلیه واجهای گفتار...

متن کامل

منابع من

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

عنوان ژورنال

پردازش علائم و داده ها

دوره 14 شماره 4

صفحات 143- 157

تاریخ انتشار 2018-03

دنبال کردن

لغو دنبال کردن

{@ msg @}

با دنبال کردن یک ژورنال هنگامی که شماره جدید این ژورنال منتشر می شود به شما از طریق ایمیل اطلاع داده می شود.

کلمات کلیدی

کلمات کلیدی برای این مقاله ارائه نشده است

میزبانی شده توسط پلتفرم ابری doprax.com